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Abstract. Every positive rational number has representations as Egyptian 
fractions (sums of reciprocals of distinct positive integers) with arbitrarily 
many terms and with arbitrarily large denominators. However, such repre- 
sentations normally use a very sparse subset of the positive integers up to 
the largest denominator. We show that for every positive rational there exist 
representations as Egyptian fractions whose largest denominator is at most N 
and whose denominators form a positive proportion of the integers up to N, 
for sufficiently large N; furthermore, the proportion is within a small factor of 
best possible. 



1. Introduction 

The ancient Egyptians wrote positive rational numbers as sums of distinct re- 
ciprocals of positive integers, or unit fractions. In 1202, Fibonacci published an 
algorithm (subsequently rediscovered by Sylvester in 1880, among others) for con- 
structing such representations, which have come to be called Egyptian fractions, 
for any positive rational number. Since that time, number theorists have been 
interested in some quantitative aspects of Egyptian fraction representations. For 
instance, there are algorithms that improve upon the Fibonacci-Sylvester algo- 
rithm in various ways, bounding the size of the largest denominator or limiting the 
number of terms. Bleicher Q| has a thorough survey of, and references to, such 
developments. 

One line of questions concentrates on the number of terms in Egyptian fraction 
representations. A positive rational m/n can always be expanded into an Egyptian 
fraction with at most m terms, for instance by the Farey series algorithm (see 
Golomb H). Erdos and Straus conjectured that for all integers n > 2, the fraction 
4/n can actually be written as the sum of three unit fractions rather than four. 
(In their conjecture, distinctness of the terms is not required.) Sierpihski || made 
the same conjecture for fractions of the form 5/n, and mentioned that Schinzel 
conjectured that for any fixed numerator m, the fraction m/n could be written as 
the sum of three unit fractions for sufficiently large n. In this vein, Vaughan [[?] 
showed that almost all positive fractions with numerator m can be written as the 
sum of three unit fractions. 

One can also investigate the behavior of the number of terms in Egyptian frac- 
tions at the other extreme. Any unit fraction can be split into two using the identity 
l/rt = l/(n+l) + l/(n(n+ 1)); consequently, given a particular Egyptian fraction 
representation, one can repeatedly use this splitting algorithm on the term with 
largest denominator to construct such representations with arbitrarily many terms. 
However, this process results in a tremendously thin set of denominators in the 
sense that if the largest denominator is x, then the number of denominators is 
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<C log log a;. One can try to use the splitting algorithm on the intermediate terms, 
but then the danger arises that the resulting unit fractions will no longer be distinct. 

It is therefore interesting to ask how many terms can be used to represent a 
rational number as an Egyptian fraction, given a bound on the size of the largest 
denominator. The purpose of this paper is to demonstrate that a positive pro- 
portion of the integers up to the bound can in fact be assembled to form such a 
representation. We will establish the following theorem: 

Theorem 1. Let r be a positive rational number and n > a real number. For 
any real number x that is sufficiently large in terms of r and r\, there is a set S of 
positive integers not exceeding x such that r = ^2 neS 1/n and \S\ > (C(r) — T))x, 
where 

C W = (l-log2)(l-e xp ( T -^)). 



That one can always find such a dense set of denominators is already surprising 
(though the Egyptians might not have been pleased to do their arithmetic this 
way!), but it turns out that the proportion given in Theorem [l] is comparable to 
the best possible result. Given a positive rational number r, any set S of positive 
integers not exceeding x with cardinality [(1 — e~ r )x\ satisfies 

J3Vn> l/n = r + O r {x- 1 ). (1) 

neS e~ r x+Kn<x 

Consequently, the best possible value for C(r) in Theorem |] would be 1 — e~ r . 
Notice that C(r)/(1 — e~ r ) = 1 — 0(r) as r tends to zero, and that 

JZfcL _ (1 _ loe2) l-exp(- r /(l -lo S 2)) > , _ _ 30685 
1 — e r 1 — e r 

Thus when r is small, the value for C(r) given in Theorem [l] is very nearly best 
possible, and in any case it is never smaller than 30% of the best possible value. 

While the proof of Theorem [l] involves a rather complex notation, the idea un- 
derlying the construction is quite straightforward. One begins by subtracting from 
r the reciprocals of a suitable set A of integers not exceeding x with cardinality at 
least (C(r) — rj)x. For a given prime p dividing the denominator of this difference, 
we can add back in the reciprocals of a very few members of A so that the factors 
of p in the denominator are cancelled out. If we repeat this process for all large 
primes, we will have expressed r as the sum of the reciprocals of almost all of the 
elements of A, plus a small rational number whose denominator is only divisible 
by small primes. We are then able to write this small rational number as the sum 
of reciprocals of distinct integers much smaller than the members of A, using the 
idea outlined above combined with an existing algorithm for expanding rational 
numbers into Egyptian fractions. 

For a given prime p and rational number r whose denominator is a multiple of p, 
we might have to add the reciprocals of as many as p — 1 multiples of p to eliminate 
the factor of p from the denominator of r, as shown in Lemmas |^ and ^. Since all 
of the elements of A are at most x in size, we must have p(j>— 1) < x\ thus we must 
restrict our attention to integers that are roughly x 1 / 2 -smooth. Fortunately, the 
distribution of smooth numbers has been widely studied. For instance, the density 
of x 1 / 2 -smooth integers of size x is 1 — log 2 (see Section ||); this is the origin of 
such factors in the expression for C(r). 
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Moreover, it turns out that we must consider integers that are divisible by differ- 
ent powers of p separately, which forces us to have at least p — 1 multiples of p 2 in 
our set A, at least p— 1 multiples of p 3 , etc. Thus we must restrict to z 1 / 2 -smooth 
integers that are squarefree with respect to primes exceeding a; 1 / 3 , cubefree with 
respect to primes exceeding x 1 / 4 , and so on. We shall find that the extra conditions 
on the multiplicity of their prime factors is easily handled. For simplicity we shall 
work with, roughly, x^-smooth numbers that are k-hee (not divisible by the kth 
power of any prime) and that are squarefree with respect to primes exceeding a; 1 '*, 
for some integer k > 2. Such integers satisfy all the above constraints, and we 
show in Lemma || that the multiples of each prime power among such integers are 
sufficiently plentiful for our argument to go through. 

We define log! x — maxjlogx, 1} and log.,- x = \og 1 (log^ x) for any integer j > 
2, and we write log fe x and log^ x as shorthand for (loga:) fc and (log.,- x) k respectively. 
We use the notations P(n) and p(n) to denote the greatest and least prime factors 
of n respectively, and make the conventions that -P(l) = 1 and p(l) = oo. We say 
that a prime power p l exactly divides an integer n, or that n is exactly divisible 
by p l , if p l divides n but p l+1 does not. The constants implicit in the <C and O- 
notations in this paper may depend on any Greek variable (a, 6, e, r\, and A) and 
also on k and the rational number r or a/b where appropriate, but they will never 
depend on p, q, t, v, w, x, or y; this remains the case when any of these variables 
is adorned with primes or subscripts. When the phrase "x is sufficiently large" is 
used, the size of x may depend on the Greek variables and fc, r, and a/b as well. 

The author would like to thank Hugh Montgomery and Trevor Wooley for their 
guidance and support and the referee for many valuable comments. This mate- 
rial is based upon work supported under a National Science Foundation Graduate 
Research Fellowship. 



2. Elementary Lemmas 

The following lemma is an easy consequence of the Cauchy-Davenport Theorem 
(see H Lemma 2.14], for instance), but we provide a direct proof. 

Lemma 2. Let t be a nonnegative integer, and let xi, . . . , Xt be nonzero elements of 
Tip, not necessarily distinct. Then the number of elements ofZ p that can be written 
as the sum of some subset (possibly empty) of the Xi is at least min{p, t + 1}. In 
particular, if t >p — 1, then every element o/Z p can be so written. 

We remark that the conclusion of the lemma is best possible, since we could take 
Xi = ■ ■ ■ = Xt ^ (mod p), in which case the set of elements of Z p that can 
be written as the sum of a subset of the Xi is precisely {0, x, . . . , tx}, which has 
cardinality t + 1 if t < p. 

Proof: We use induction on t, the case t = being trivial. Given a positive integer 
t and nonzero elements xi,...,Xt of Z p , let Si be the set of elements of Z p that 
can be written as the sum of some subset of x\, . . . , xf, then certainly Si C St and 
so \Si\ < \S t \. By induction we may assume that \S t -i\ > min{p, t}. If |<S f _i| = p 
then |5t| = p as well, so we may assume that p > > t. 

Suppose that |<5> t | < t + 1. Then we have \S t \ < t < \S t ~i\ < \S t \, and so 
\St\ — \St-i\ = t <p. In particular, the map / : S t -i — > S t defined by f(y) — y + x t 
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is actually a bijection of <5>t_i onto itself. Thus 

^2 y = ^2 (y + x t) = tx t + ^2 y( mod p)> 

yeSt-i yeSt-i 2/GSt-i 

which implies that txt = (mod p) , a contradiction since Xt is nonzero (mod p) 
and < t < p. Therefore |<5>t| >t + l, which establishes the lemma. □ 

Using this lemma, we can show that a factor of a prime p can be eliminated 
from the denominator of a rational number by adding the reciprocals of fewer than 
p integers from any prescribed set meeting certain conditions. 

Lemma 3. Let p l be a prime power, and let N be an integer that is exactly divisible 
by p\ Let c/d be a rational number with d dividing N , and let S be a set of integers 
dividing N that are all exactly divisible by p l . If \S\ > p — 1, then there is a subset 
T of S with cardinality less than p such that, if we define 

c! c 1 
d! d 2-i n 

neT 

with d I d! in lowest terms, then d! divides N/p. 

Proof: Without loss of generality we may assume that |«S| = p — 1, whereupon 
we denote the elements of S by m, . . . , n p -i- Let M = \cm{d, rii, ■ ■ ■ , n p _i}, let 
to = M/d, and let m, = M/rii for each 1 < % < p — 1. Note that M divides N and 
that each m, is nonzero (mod p) . By Lemma every element of Z p can be written 
as the sum of some subset of mi, . . . ,to p _i. In particular, we can choose distinct 
indices ii, ... , it for some < t < p — lso that 

nii 1 + • ■ • + m- H = —cm (mod p). (2) 

If we set T = {n^ , . . . ,n it }, then 



1 * 
c x - 1 cm \ ^ rriij 



(cm + m it H h m it )/p 



n M ^ M M/p 

n£T j=l IJ - 

where the numerator is an integer by virtue of equation (^). Since M divides N, 
this establishes the lemma. □ 

We now cite an existing Egyptian fraction algorithm, which we will use near the 
end of the proof of Theorem [y. 

Lemma 4. Let c/d be a positive rational number with d odd. Suppose that c/d < 
1/ P(d). Then there exists a set C of distinct odd positive integers, with 

max{n 6 C} -C d J^J p, 

p<P{d) 

such that c/d = X)nec 

Proof: Brcusch has shown that any positive rational number with odd denominator 
can be written as the sum of reciprocals of distinct odd positive integers. When 
one examines his construction Lemmas 1-3], one finds that when c/d < 1/ P(d), 
the integers involved in fact do not exceed 

51cm{d, 3 2 Y[ P}> 

3<p<P(d) 
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which implies the bound given in the statement of the lemma. □ 

3. Smooth Number Sets 

For real numbers x, y > 1, we recall the definition of A(x, y), the set of y-smooth 
numbers up to x: 

A(x, y) = {n < x : P(n) < y}. 

Let w > 1 and < A < 1 be real numbers and k > 2 be an integer. We will need to 
work with the following sets of smooth numbers with various specified properties: 

A(x, y;w, X) = {n : Xx < n < x; P{n) < y; n is fc-free; d 2 \ n =>- P(d) < w}, 

A(x, y; w, A; p l ) = {n S A(x, y;w,X) : n — mp l with P(m) < p}, 

Ao{x, y; w 7 A) = {n S A(x, y; w,X) : n is odd, not of the form m 2 + m — l}i(3) 

A (x, y; w, X;p l ) = A (x, y; w, A) n A(x, y; w, X;p l ). 

The first of these sets is the set of k-free smooth numbers that are squarefree with 
regard to large primes, as described in the introduction, while the second is the 
subset of the first consisting of those integers whose largest prime factor is p with 
multiplicity exactly I. The sets A(x, y; w, X;p l ) and Aq(x, y; w, A; p l ) do not actually 
depend on the parameter y as long as y > p; but we retain the parameter y for 
consistency of notation, and as a reminder that A(x, y; w, X;p l ) C A(x, y; w, A) and 
likewise for the .Ao-sets. 

We note that for any y' < y, we may write A{x, y; w, A) as the disjoint union 

A(x,y;w,X) = A(x,y';w,X) U ( [j A(x,y;w, X;p)^j U ( [j (J A{x, y; w, X;p 

y' <p<y y'<p<y i=1 

p>w p<w 

and the same is true if we replace every occurrence of A by Ao ■ 

As is customary, we let ^>(x,y) denote the cardinality of A(x,y). We will also 
use ty, with any list of arguments and with or without subscript, to denote the car- 
dinality of the corresponding ,4-set. It is well-known that ^(x, y) ~ p(\ogx/ \ogy)x 
for a certain range of x and y, where the Dickman rho-function p(u) is defined for 
u > to be the continuous solution to the differential-difference equation 

p(u) = l, 0<u<l; 

(4) 

up'{u) = —p(u — 1), u > 1. 

The following lemma, due to Hildebrand, describes this asymptotic formula more 
specifically. We have not made any effort to optimize the error term or the range 
of y in the hypothesis, as it will suffice for our purposes as stated. 

Lemma 5. Let x > 3 and y be real numbers satisfying exp(log2x) < y < x. Then 

Vlogy/V V logy 
Proof: Hildebrand shows B Theorem 1] that for any e > 0, one has 

' u log 1 u 



*(*,*"■) -m.)(i + o(^=)) 

uniformly for x > 3 and 1 < u < logx/(log 2 x) 5 ^ 3+s . On setting e = 7/3 and 
u = log xj logy (so that y = x 1 ^), the lemma follows immediately. □ 
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From the definition (0) of p(u), one can easily derive that p is a positive, de- 
creasing function, and that p(u) = 1 — logu for f < u < 2, so that p(2) = 1 — log 2 
in particular. We will need the following additional properties of p(u). 

Lemma 6. Let x > 3, v > 1, and y be real numbers satisfying v < x and 
exp(log2 x) < y < x. Then: 

/log /logz\ , / 



V logy / Vlogy/ 



/log a; 



(b) for any e > 0, we /icwe p( , 

Vlogy 

(c) for any real number a > 1, we have 



-l 



< x e 



£ »->(^)<<->&°->-" +1 ; 



logy / Vlogy 

log :c<n<:r x ' a 



/i \ .,, ( logy \ „ /logx/v^ flogx^ , ^logtdogjX 

(d) «/logu = " 



^),then P C^)=p(p^)(l + 0( ] - 
Vlogax/ V logy / Vlogy/ V V logy 

Proof: Hildebrand shows H Lemma l(vi)] that for any < t < u, we have 

p{u-t) <p(u)(ulog?u)*. (5) 

Part (a) follows immediately on taking u = log x/ logy and i = log vj logy and 
noting that wlog^u < logx. Part (b) follows from part (a) on taking v = x and 
noting that 

By using part (a) again, the sum in part (c) is 



n -a+a/ \ogl x 



«p(^) E ^K) log ^ /lo -«p(f^) E 

Vlogy/ ^ v ; r Vlogy/ 4^ r 

toy lo S T<n<ii/« by n>loga; (6) 

For x sufficiently large, the exponent —a + a/log^x is bounded above away from 
— 1. Therefore the right hand side of (0) is 



/logx\., 

«p r^- logs 

Vlogy/ 



a+a/ log^z+l 



=O (l06 * r " +I ^ (a/l0 ^ )<< ' > © 

which establishes part (c). 

Finally, for any real numbers 1 < s < t, we have 

p(s) - pit) = - f p'iu) du = [ du 

Js Js U 

^ Pp - 1) u _ _/^/. 1 __3.M/'*-* > 



'(t-sXp^^log^) 1 ^) 



S V s 

by equation (||). Therefore 

p(i)=p( S )(l + 0((<-,s)log? S )). 

Letting s = log(x/w)/ logy and t = logx/ logy, we see that 

/logxx = /logx/.x / + Q /bg^log|xxv 
Vlogy/ V logy /V V logy // 
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and under the hypothesis that logf = o(log y/ log2 x), the (1 + 0(-))-term can be 
inverted and moved to the other side of the equation, which establishes part (d). 

□ 

With Lemmas |^ and |^ at our disposal, we can establish the following lemmas 
concerning the distributions of the sets of smooth numbers defined in equation ([}]) . 

Lemma 7. Let x > 3, y, and w be real numbers satisfying exp(log2 x) < y < x 
and w > logx. Then: 



(b) *o(x, 9! »,0) = ^(|2££)(l + o(!j^)), where Hk) = 2(l-2-*)«t). 



Proof: From the definition (||) of A(x, y; w, A), we have 

x-H(x,y;w,0) = x~ l £ ( ]T M (d)V ]T M (/)Y 

n<z ^ •>>' / V ,2, 



d"\n f\n 
P(n)<y p( d )<w pU)>™ 

Interchanging the order of summation yields 
x- 1 K(x,y;w,0) = x- 1 £ £ M (/) £ 1 

d<a: 1/fc f<\fx~dF k ~ n < x 

P(d)<min{y.w} P(f)<y P(n)<V 
p(f)>w d k \n 
f 2 \n 

d<a; 1/fc f<Vxd~ k 
P(d)<min{y,w} P(f)<V 
p(f)>w 

- E E ^(^(ho^ 

<i<:r 1/ ' c f<Vxd- k 
P(d)<min{y,w} P(f)<y 
p(f)>w 

where the final equality follows from Lemma ^. The primary contribution to the 
double sum will come from those terms with d small and / = 1 (notice that if / > 1 
then / > w > logx), so we write it as 



logy / V V logy 

a<log x 

+< E -v(^) + X>"> E rv(^)). m 

log x^d^a: 1 /^ eZ— 1 log x<f<.y/x 
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The two final terms can be estimated by Lemma |s](c) 

gxd~ k \ / logan 
logy / P Vlogy/ 



V d p[— )<p log a;, 

^ V logy y VIoku/ 



log x<d<a; 1 / fc 
00 . p-2 



E E rv(^) « ' * E <- - * 

d=l logx</<VS <*=1 
Also, for d < \ogx, Lemma ||(d) gives us 

/logzcTS /logjrw f (logflog^log 2 ,^ 

Thus equation (7) becomes 

x- 1 * (a;;y;u; ,o) = p(^)((l + o(^)) £ K^ + OCOogarJ-^+log- 1 : 
On noting that 



2 //(d)d- fc = 5>(d)<r fc + of 2 d- fc j =c(fe)- 1 + o(io g - fc+1 a; ), 

c£<loga; d— 1 d>loga; 

part (a) of the lemma is established. 

Let A\(x, y; w, 0) and Az{x, y; w, 0) denote the odd and even elements of A{x, y; w, 0), 
respectively. For a set S and an integer n, write n ■ S = {ns : s € S} and note that 
\n ■ S\ = \S\. We see that 

2 • „4(x/2, y; w, 0) = {2n : n < x/2; n is fc-free; P(n) < y; d 2 \ n P(d) < w} 

= {n : n < x; n is even and fc-free; P(n) < y; d 2 | n P(d) < w} 

U {2 fe n : n < x/2 k ; n is odd and fc-free; P(n) < y; d 2 \ n =4> P(d) < w} 
= A 2 (x, y; w, 0) U 2 k ■ A x (x/2 k , y; w, 0) 
as a disjoint union. Taking cardinalities on both sides, we see that 
*0/2, y; w, 0) = * 2 (z, 2/; u>, 0) + *i(x/2 k ,y; w, 0), 

or equivalently 

¥i(jc, y; w, 0) - V 1 (x/2 k ,y; w, 0) = $>{x, y; w, 0) - *(x/2, y; w, 0), 
since \& = + ^2- Consequently, for any nonnegative integer to, we have 

m 

*i(ar, y; tu, 0) - * 1 (x/2( m + 1 ) fe I y; W , 0) = £ (^(ar/a* y; w, 0) - * 1 (or/2«+ 1 ) fe , y; «;, 0)) 

i=o 

m 

= J2 (*(^/2 jfe , y; «>, 0) - *(x/V k +\y; w, 0)) .( 8 ) 
i=o 

Choose to = Llog 2 x/(fclog2)J, so that 2 mk < logx < 2(" l+1 ) fe . For any < I < 
(to + l)fc, part (a) gives us 

, W2 ,, s; „, 0) ._£_ K « )(l + o( ^ )) 

- (> + " C^'s;" 08 - 1 )) (> - °(^)) 
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by Lemma ^(d) and the choice of m. Using this in equation (0) gives us 



* 1 («,y;«, l 0) = g - -_£_) p (^£) (i + o 



logy 



O(#(:c/2( m+1 ) fc ,y;u;,0)) 



2£(k) Vlogy/V V logy 



i=0 

^3 , 



2C(fc) Vlogy/V V logy 
' log a; 



V log cc V log y / / 
x /logxw n /logJx\\ 



Finally, the number of integers not exceeding x of the form m 2 + m — 1 is at 
most y^E. Therefore, 

* (a;,y;w;,0) = Vx(x, y; w, 0) + 0{y/x) 



£(fc) Vlogy/V V logy Vlogy/ 

Since the second error term is <C x~ x / 2+£ by Lemma [|(b), it is dominated by the 
first error term, and so part (b) is established. □ 

Lemma 8. Let k > 2 be an integer and < e < 1/k and < A < 1 be real 
numbers. Let x be a sufficiently large real number, and let y and w be real numbers 
and p a prime satisfying log a; < w < e )/ fe and exp(log2x) < p < y < x^ £ ^ 2 . 
Let I < k a positive integer, with I — 1 if p > w. Then: 



(a)(tfesi „,^ (1 _^ KS £ )(l + o(^)) ; 



C(fc) Vlogy/V V logy 

(b) ,„ (l , B „, A) _ & _^,(g£)( 1 + (g£)),. 

(c) V(x,y;w, X;p l ) > p, and if p ^ 2, then ^ {x,y;w, \;p l ) > p; 

(d) there exists an element of A(x,y;w, A) that is exactly divisible by 2 l (for any 
1 < I < k). 

Proof: Clearly ^(x, y; w, A) = ^(x, y; u>, 0) — ^(Ax, y; w, 0) and similarly for the 
^o, and so parts (a) and (b) follow from Lemma fj] on noting that 

' log Xx\ ( log x \ / ( log2 x N 



p(^)=p(P)(i + o(^)) 

V logy / VWw/ V V log?/ // 



3gy / uogy/ v v logy 

by Lemma ^(d) with v — A -1 . 

We notice that mp' € ^4(x, y; w, A;p') if and only if m € A(xp~ l , p — 1; w, A), and 

so 

- ^ P -^,A)=p(^)(i^(i +0 ( i ^; 
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by part (a). To show that this is greater than p, it suffices to show that 
((k)p l+1 ( logxp- 1 \~i / / log^x \\ 

But under the restrictions on p, I, y, and w, we have that p l+1 < £ 1_E ; and by 
Lemma ^|(b) we have that p(\ogxp~ 1 / log(p — 1)) _1 < p(log x/ log(p — <C 
x £ / 2 . Therefore, the left-hand side of equation (j|) is -C x 1_£ / 2 . This establishes 
the first assertion of part (c), and the proof of the second assertion is similar. 

Finally, by part (b) with x replaced by x/2 l , we see that ^ (x/2 l ,y; w, A) is posi- 
tive for sufficiently large x. But then we can choose an element m of Ao(x/2 l , y; w, A), 
and then 2 l m is an element of A(x,y;w, A) that is exactly divisible by 2', which 
establishes part (d). □ 

Lemma 9. Under the hypotheses of Lemma |^, we have 

( c ) if U — y( x ) * s chosen so that log a;/ logy < 5 /or some constant B depending 
only on k, then for any positive real number a, there exists a real number 
< A < 1, bounded away from zero uniformly in x, such that when x is 
sufficiently large, we have 

0<a- Y n- 1 < x- 1 exp(Da) (10) 

for some constant D depending only on k. 
When we apply part (c) we will need only that the middle expression in equation 
( flO| ) is positive and <C x^ 1 , but we have been explicit about the upper bound for 
two reasons. First, since we will use part (c) to choose a value of A, it is important 
that the implicit constant not depend on A. Second, we will apply part (c) with 
a very small value of a, and one that is specified only up to an error bounded by 
a function of x. From the form of the upper bound in equation (|l^), we will be 
assured that this will not inflate the implicit constant as x grows. 

Proof: We have 



C(fc) ]T n-^ ((h) f r 1 d*(t,y,w,0) 



nE<A{x,y;w ) X) 



C(*)t _1 *(*,»;«>,o)|L + C(fc) / *(t,v,w,o)% 



\x t 2 (11) 



/ nogxy AogXx, r P (^l) d J-)(l + 

V V1 °gy y v !ogy ' J\x v iogy/ t j v 



log 2 a; 



logy 

by LemmaWa). However, since logA -1 = 0(1) = o(logy/log2 x), we can apply 
Lemma ^(d) to see that 

p(r 5i )=p(r 5f )(i + o(^)) 

Vlogy/ Vlogy/V V logy // 
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uniformly for Xx < t < x. Therefore equation ( fLl| ) becomes 



3 



pf^logA-fl + O^ 
V log y / V 



yj V V logy 

This establishes part (a), and the proof of part (b) is exactly similar. 

As for part (c), note that the function a — Y^ n eA (x yw t) n ^ s an increasing 
function of t, with jump discontinuities of size not exceeding (ix) -1 , and which 
takes negative values if x is sufficiently large. Therefore we can choose A such that 

0<a- n -1 < (As) -1 . (12) 

n£Ao(x,y;w,\) 

For this value of A, part (b) tells us that 

a> ^ ; =^rHio^)( 1+0 (fog7 



which implies that 



p(B)\ V logy 

This shows that A is bounded away from zero uniformly in x; and combining this 



A 1 < exp 1 + 0' 



bound with equation (h2|) and writing D = 2£(k)/p(B) establishes equation (10) 



for sufficiently large x. □ 

4. Construction of Dense Egyptian Fractions 

We now proceed with the proof of Theorem [|. Let r = a/b be a positive rational 
number and rj a positive real number. For the first stage of the proof, let < e < 1/2 
and < 5 < r be real numbers and k > 2 an integer that is large enough to ensure 
that b is /c-free; we will later constrain these parameters in terms of r and 77. Set 

Let x > 3 be a sufficiently large real number, set y = a^ 1 " 6 )/ 2 and w = x^ 1 ~ e ^' k , 
and for z > 2 define 



j)(z)=%,)=(nH( n 

N TXT" K ' Sii/n.-'r ' 



p<2 7 x w<p<z 

p<w 



D (z) = 2-^D(z), 



so that Do(z) is the odd part of D(z). (We understand that the second product in 
the definition of D(z) is 1 if z < w.) Notice that if p is a prime, then p does not 
divide D(p). 

Let pi > P2 > ■ ■ ■ > Pr. be the primes in (w,y\, and for later consistency of 
notation, let po be the smallest prime exceeding y. Notice that all elements of 
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A(x,y;w,0) divide D(p ), as does b as long as P(b) < w; we henceforth assume 
that x is large enough to ensure this. Define 

ao _ _ \ - 1 
bo 2— 1 n 

with ao/bo in lowest terms. Notice that 60 divides D(pq) and that 

?=*+0(^) (14) 
6 V logy / 

by Lemma ^(a) and the choice (|ll|) of A. 

We now recursively construct a sequence of fractions ax/b\, a 2 /b 2 , ■ ■ ■ , an/bn 
with the following two properties. First, each di/bi is obtained from the previous 
a-i-i/bi-i by adding the reciprocals of a few elements of A(x, y; w, A), specifically 
elements of A(x, y; w, \;Pi); so when we have constructed these fractions, we will 
have written r as the sum of an/bn and the reciprocals of almost all of the elements 
of A(x, y, w, A). Second, all of the primes dividing each denominator bi will be less 
than pi , so the denominators are becoming gradually smoother. 

Formally, we construct the fractions di/bi as follows. Given aj_i/6j_i, where 
1 < i < R, we apply Lemma || with p = pi, I = 1, N = D(pj_i), c/d — di-x/h-x, 
and S — A(x,y;w, X;pi). This gives us a subset of A(x, y; w, A;pi), which we call 
Bi, with cardinality less than pt such that, if we define 



E 1 



bi h-i .. 

with di/bi in lowest terms, then bi divides D(pi-i)/pi — D(pt). We are justified 
in applying Lemma [| with these parameters since, by Lemma ||(c), the size of 
A(x,y;w, X;pi) is at least pi. 

Now set y' — exp(log2 x), and let qi > q 2 > ■ ■ ■ > qs be the primes in [y', w); and 
for later consistency of notation, set go = PR an d ao,fe/&o,fc = <Zr/6r. We recursively 
construct another sequence of fractions ^1,1/61,1, 01,2/^1,21 • ■ ■ 1 a x,k/bx,k-> a 2, 1/^2,1, 
• • ■ 1 a S,k/bs,k with similar properties to the first sequence, the differences coming 
from the fact that the primes qi may potentially divide the denominators we are 
working with to the (fc — l)st power. First, each di^/b^x is equal to di-x,k/bi-x,k, 
and each (Hj/bij is obtained from the previous dij-x/bij-x by adding the recipro- 
cals of a few elements of A(x, y; w, A), specifically elements of A(x, y; w, A; <zf 
again, we will have written r as be the sum of as t k/bs,k and the reciprocals of 
almost all of the elements of A(x, y; w, A). Second, all of the primes dividing each 
denominator b$ j will be less than or equal to qi, and qi itself will divide bij at most 
to the (k — j + l)st power, so the denominators are becoming gradually smoother 
still. 

Formally, we construct the fractions di_j/bi_j as follows. First, given di-x,k/bi—x,ki 
where 1 < i < S, we set a^x/b^x = <H-x,k/bi-x,k- Then, given aij—x/bij-i, where 
2 < j < k, we apply Lemma || with p = <7j, / = fc — j + 1, iV = 
c/d = dij-x/bij-x, and 5 = A(x, y; u>, A; ^ This gives us a subset of 
^4(x, y; w, A; qf - ^ 1 ), which we call Bij, with cardinality less than qi such that, 
if we define 

a, w a,- „•_ 1 1 — > 1 
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with a-ij/bij in lowest terms, then bij divides q^~^ D(qi). Again, Lemma ^(c) 
justifies our use of Lemma || with these parameters. We also note that D{qi^i) = 
<7j -1 D(qi), and so our convention that Oi— i,fc/6t-i,]fe = cii,i/6i,i is consistent with 
the divisibility property of the denominators. 

We have gradually eliminated all prime factors not less than y' from the de- 
nominators of the fractions a-ij/bij] to finish the first stage of the proof, we 
eliminate powers of two as well. We set as+i,i/bs+i.i = as,k/bs,k and define a 
sequence of fractions 05+1,2/65+1,2, 05+1,3/65+1,3, ••• , Os+i,fc/&S+i,fe, such that 
each as+i,j /bs+ij — as+i,j-i/bs+i,j-i is either zero or else is the reciprocal of 
an element of A{x, y'\ w, A), and such that powers of two dividing the bs+ij are 
diminishing. Given ag+ij— i/bs+x,j— 1, where 2 < j < fc, if 2 /£ ~ : ' +:L exactly divides 
65+1,3-1 then we invoke Lemma ||(d) to choose an element rij G y'; w, A) that 
is exactly divisible by 2 /c ~- J+1 and set as+i,j /bs+ij = as+ij-i/bs+ij-i + 1/n.j 
and Bs+ij = {n<j}- Otherwise, we set as+ij /bs+ij — as+i,j-i/bs+i,j-i and 
Bs+i,j = 0- It is easy to see inductively that with this construction, bs+ij divides 
2 k -iD (y') for each 1 < j < k. 

Define 

/ R s. /5+1 k s 

B = U B U U U B « ■ (15) 

^ i=l ' ^ i=l j=2 ' 

We note the largest prime factor of each element of a Bi is Pi, and the largest prime 
factor of each element of a Bi^ is qi, except that the largest prime factor of each 
element of a Bs+ij is less than y' . Furthermore, for a fixed i < S, the various 
Bij have elements which are divisible by qi to different powers; and the elements 
of the various Bs+ij are divisible by different powers of two. We conclude that the 
union in the definition ( |l5| ) of B is actually a disjoint union, and thus if we define 
A = A(x, y; w, A) \ B, then we have written 

as+i,k . 1 



(16) 



with bs+i.k dividing Dq {y')- Moreover, the cardinality of B is bounded by 

(p-l) + (fc-i) £ (p-i) + (fc-i)«y 2 -^- e , 

w<p<y y '<p<w 

so that the cardinality of A is 

\A\>\A(x : y;w,X)\-0(x^) 

^->mKi^)( 1+ <0 (17) 



by Lemma ||(a) . We also note that 

^±M^ + Vl = (<5 + o(^)) + 0( IBI(Ax)- 1 ) = 5 + o(^) 

by equation ( fl4| ) and the fact that \B\ -C 

From the choice ( p"g| ) of A = A(e, 5, fc) and the continuity of p(u), we see that 

1 - A / 2 \ / / -r V 



hm 

£ 

A' 



3 u/^(T^) = ( 1 - exp (K2T))^ 2 ) = c7 « 
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We thus choose e and S sufficiently small and k sufficiently large, in terms of r and 
77, to ensure that (1 — A)p(2/(1 — e))/£(fc) > C(r) — 77; then, from equation (|l7|), 
we see that |.4| > (C(r) — Ty)cc for sufficiently large x, (We note that now x needs 
only to be sufficiently large in terms of r and 77, since 5, e, and k, and thus A, have 
all been chosen in terms of r and r/.) Moreover, all elements of A are certainly 
greater than Ax. Therefore, to establish Theorem |l|, it suffices by equation (|l^) to 
write as+i,k/bs+i,k as the sum of reciprocals of distinct integers not exceeding Xx, 
without regard to the number of terms in the representation. This is the goal of 
the second stage of the proof. 

We begin by applying to as+i,k/ b S+i,k much the same process that we applied 
to r in the first stage of the proof. Recall that y' = exp(log2 x), and set x' = (y') 2k , 
w' = y', and y" = (logx)/3fc. Let q[ > q' 2 > ■ ■ ■ > q' T be the primes in [y",y') : 
and for later consistency of notation, set q' — qs- Since log x' / logy' = 2k and 
exp(log2x') < y" < y' = w' < (x')' 1_e ^ fc for x sufficiently large, we may freely 
appeal to Lemmas |^ and ^] with the parameters x' , y' , and w' and any prime q[. In 
particular, by Lemma ^(c), we may choose A' so that 

0-S+l,k -1 ^1 r\-l ( nn \ a S+l,k* 



< 

Define 



£ <{xr 1 ^(D{k)f^-). (19) 

* — ' . V 6,94-1 .k / 



>S+l,k _ . , , , , .,, v os+l,k ■ 

n£ A (x ,y ';y ',A ') 



a 0,fc _ «s+i,fc _ _1 

with a' ok /b' ok in lowest terms, and notice that b' k divides D (y'). With this 
definition and the estimate ([l^) on the size of <2s+i,k/ b S+i,k> equation (19) becomes 



0< ^ <(ir . exp ( I)W ( i + o(^)))« (l0 - 



We recursively construct a sequence of fractions <4 1/^1 ij a i 2/^1 21 • ■ ■ ) a 'i k/ b 'i fc> 
a 2.i/^2.i: ■ • • 1 a T,k/ b Tk as follows. First, given a' i _ 1 k /b' i _ 1 k , where 1 < i < T, we 
set a 'i.i/ b i,i = a 'i-i,k/ b 'i-i,k- Thcn ' S ivcn a 'i,j-i/ b i,j-i> where 2 < j < k, we apply 
Lemma I with p = q<, I = k - j + 1, N = (q<) k - j+1 D(q<), c/d = a'^^/b'^^, and 
S = A{x',y';y',X';(ql) k - j+1 ). This gives us a subset of A{x', y'; y', A'; (^) k ~ j+1 ), 
which we call /S,' , with cardinality less than q\ such that, if we define 



H 1 



with in lowest terms, then &^ . divides (^) fe_J D(^). 

Define 



i=l j=2 



and notice that this is a disjoint union by the reasoning following equation (15) 
earlier. Thus, if we define A' = Aq{x' , y'\ y' , A') \ B' , we see that 
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with b' T k dividing Do(y"); in particular, P(b' Tk ) < y" . Moreover, 

o<^ = ^ i +E 1 « vr 1 + ia'kx'xt 1 « (?/) vr 1 = oiiv")- 1 ), 

b T,k °0,k neA , n 

and so a' Tk /b' Tk < (P(b' T fc )) _1 for x sufficiently large. Since b' T k divides D (y"), 
it is odd, and so we can apply Lemma |] to find a set C of positive odd integers such 
that a'/b' T k = J2nec V n and 



max{n eC}« b' T k J| p < J\ p k = expf k E logp 

p<P(b' Tk ) p<y" ^ p<y" 

Chebyshev's bound for Tr(t) implies that there is a real number c < 2 such that 
J2 p <t l°gP < c ^ f° r an y positive t. Therefore 

max{n eC}« exp(fccy") < x 2/3 . 

We have almost achieved our goal for the second stage of the proof, for as+\ t k/bs+\,k 
TlineA' l/ n + Tlmec V n ; Du t -A' and C might not be disjoint. Define 

Vx = {n + l:neA'nC} and V 2 = {n(n+ 1) : n G .A' n C}. 

Since 1/n = l/(n+ 1) + l/(n(n + 1)), we have 

Qg+i,fc _ j_ j_ j_ 

° + ' r»S>t' neC\.4' nGA'nC 

11 1 1 ^ 

= £- + E -+ E -+ E 

neA' nEC\A' neVx neV 2 

and we claim that this is a disjoint representation of as+\k/bs+ik using denomi- 
nators less than Xx. We already know that max{n G A'} < x 1 and max{n gC}< 
x 2 / 3 , and we easily see that 

max{n G V x U X> 2 } < (max{n G A'}) 2 < (a/) 2 , 

so that the integers involved are of admissible size. 

Clearly A' and C \ A' are disjoint; and since the elements of A 1 and C are 
odd, those of T>\ and 2?2 are even, and so each of the first two is disjoint from 
each of the last two. Finally, if there were an element n in T)\ n X> 2 , then there 
would exist mi, m 2 £ A' DC such that n = mi + 1 = 7712(7712 + 1). But then 
mi G A' C Ao(x' , y'', y' ' , A') satisfies mi = m\ + mi — 1, contradicting the definition 
of an Ao-set. Therefore the four sets in the representation (^lj) are indeed disjoint, 
and Theorem [l| is established. 

5. Prospects for Improved Results 

Clearly the only barrier to establishing Theorem [j] in best-possible form is the 
presence of the factors 1 — log 2 in the expression for C(r). This quantity arises 
as p(2), which comes from using integers of size x that are roughly x^-smooth, 
which in turn comes from the necessity that we have at least p — 1 multiples of 
every prime p at our disposal in order to invoke Lemma || and thus Lemma ^. 

As noted in Section |[ the conclusion of Lemma ^ is best possible, since we can 
arrange for many sums of subsets to coincide. For randomly chosen sets, however, 
this behavior is very unlikely. Heuristically, a set of random nonzero elements of 
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Zip should have a subset summing to a randomly chosen target element of Z p as 
soon as the size of the set is as large as a small power of log p. We might expect, 
then, that if we were to attempt the construction in the proof of Theorem [j] using 
integers that were x 1_e -smooth instead of ce( 1_e " 2 -smooth, then we would have at 
least p £ multiples of each prime p to choose from, and almost always we could find 
a small subset of those multiples to exclude to force the necessary cancellation of 
factors of p. If one could show that this were the case, the factors tending to p(2) 
could be replaced by factors tending to p(l) = 1, and we would have established 
the best possible result. 

Number theorists have also investigated whether positive rationals have Egyptian 
fraction representations of various specified forms. For instance, we have seen in 
Lemma |] that any positive rational with odd denominator can be written as the 
sum of reciprocals of distinct odd positive integers (clearly no such representation 
exists for positive rationals whose denominator in reduced form is even, since the 
common denominator of such a representation will necessarily be odd). Graham 
has proven a very general theorem showing that, for a certain class of subsets Z of 
the positive integers, any positive rational can be written as an Egyptian fraction 
with denominators restricted to elements of Z, provided only that it satisfies to 
clearly necessary assumptions. Its denominator must not be divisible by any primes 
that don't divide any element of Z, as in the case of odd denominators discussed 
above; and its size must be compatible with the sizes of the finite subsums of the 
reciprocals of Z — the series of reciprocals of Z might converge, for instance, in 
which case large rationals could never be so represented. 

It does not seem implausible, therefore, that any positive rational with odd de- 
nominator has an Egyptian fraction representation consisting of 3> x unit fractions 
all of whose denominators are odd and at most x; or that a positive rational meeting 
the local conditions prescribed by Z can be written as an Egyptian fraction using 
an asymptotically positive proportion of the elements of Z. Furthermore, one might 
even believe that the best-possible bound for that proportion, which can be derived 
from the function J2nez \x<n<x V n as we did m ec l ua tion (|l]) for Z = Z, is in fact 
attainable. 
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